IRBIS: a systematic search for conserved complementarity.
Identifieur interne : 001C30 ( Main/Exploration ); précédent : 001C29; suivant : 001C31IRBIS: a systematic search for conserved complementarity.
Auteurs : Dmitri D. Pervouchine [Russie]Source :
- RNA (New York, N.Y.) [ 1469-9001 ] ; 2014.
Descripteurs français
- KwdFr :
- Animaux, Caenorhabditis elegans (génétique), Données de séquences moléculaires, Drosophila melanogaster (génétique), Exons (génétique), Gènes (génétique), Humains, Introns (génétique), Logiciel, Petit ARN nucléolaire (génétique), Similitude de séquences d'acides nucléiques, Séquence conservée (génétique), Séquence nucléotidique, Épissage des ARN (génétique).
- MESH :
English descriptors
- KwdEn :
- Animals, Base Sequence, Caenorhabditis elegans (genetics), Conserved Sequence (genetics), Drosophila melanogaster (genetics), Exons (genetics), Genes (genetics), Humans, Introns (genetics), Molecular Sequence Data, RNA Splicing (genetics), RNA, Small Nucleolar (genetics), Sequence Homology, Nucleic Acid, Software.
- MESH :
- chemical , genetics : RNA, Small Nucleolar.
- genetics : Caenorhabditis elegans, Conserved Sequence, Drosophila melanogaster, Exons, Genes, Introns, RNA Splicing.
- Animals, Base Sequence, Humans, Molecular Sequence Data, Sequence Homology, Nucleic Acid, Software.
Abstract
IRBIS is a computational pipeline for detecting conserved complementary regions in unaligned orthologous sequences. Unlike other methods, it follows the "first-fold-then-align" principle in which all possible combinations of complementary k-mers are searched for simultaneous conservation. The novel trimming procedure reduces the size of the search space and improves the performance to the point where large-scale analyses of intra- and intermolecular RNA-RNA interactions become possible. In this article, I provide a rigorous description of the method, benchmarking on simulated and real data, and a set of stringent predictions of intramolecular RNA structure in placental mammals, drosophilids, and nematodes. I discuss two particular cases of long-range RNA structures that are likely to have a causal effect on single- and multiple-exon skipping, one in the mammalian gene Dystonin and the other in the insect gene Ca-α1D. In Dystonin, one of the two complementary boxes contains a binding site of Rbfox protein similar to one recently described in Enah gene. I also report that snoRNAs and long noncoding RNAs (lncRNAs) have a high capacity of base-pairing to introns of protein-coding genes, suggesting possible involvement of these transcripts in splicing regulation. I also find that conserved sequences that occur equally likely on both strands of DNA (e.g., transcription factor binding sites) contribute strongly to the false-discovery rate and, therefore, would confound every such analysis. IRBIS is an open-source software that is available at http://genome.crg.es/~dmitri/irbis/.
DOI: 10.1261/rna.045088.114
PubMed: 25142064
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream PubMed, to step Corpus: 001875
- to stream PubMed, to step Curation: 001875
- to stream PubMed, to step Checkpoint: 001884
- to stream Ncbi, to step Merge: 000E72
- to stream Ncbi, to step Curation: 000E72
- to stream Ncbi, to step Checkpoint: 000E72
- to stream Main, to step Merge: 001C43
- to stream Main, to step Curation: 001C30
Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">IRBIS: a systematic search for conserved complementarity.</title>
<author><name sortKey="Pervouchine, Dmitri D" sort="Pervouchine, Dmitri D" uniqKey="Pervouchine D" first="Dmitri D" last="Pervouchine">Dmitri D. Pervouchine</name>
<affiliation wicri:level="4"><nlm:affiliation>Centre for Genomic Regulation and UPF, Barcelona 08003, Spain Faculty of Bioengineering and Bioinformatics, Moscow State University, 119992 Moscow, Russia dp@crg.eu.</nlm:affiliation>
<orgName type="university">Université d'État de Moscou</orgName>
<country>Russie</country>
<placeName><settlement type="city">Moscou</settlement>
<region>District fédéral central</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PubMed</idno>
<date when="2014">2014</date>
<idno type="RBID">pubmed:25142064</idno>
<idno type="pmid">25142064</idno>
<idno type="doi">10.1261/rna.045088.114</idno>
<idno type="wicri:Area/PubMed/Corpus">001875</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">001875</idno>
<idno type="wicri:Area/PubMed/Curation">001875</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Curation">001875</idno>
<idno type="wicri:Area/PubMed/Checkpoint">001884</idno>
<idno type="wicri:explorRef" wicri:stream="Checkpoint" wicri:step="PubMed">001884</idno>
<idno type="wicri:Area/Ncbi/Merge">000E72</idno>
<idno type="wicri:Area/Ncbi/Curation">000E72</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000E72</idno>
<idno type="wicri:Area/Main/Merge">001C43</idno>
<idno type="wicri:Area/Main/Curation">001C30</idno>
<idno type="wicri:Area/Main/Exploration">001C30</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">IRBIS: a systematic search for conserved complementarity.</title>
<author><name sortKey="Pervouchine, Dmitri D" sort="Pervouchine, Dmitri D" uniqKey="Pervouchine D" first="Dmitri D" last="Pervouchine">Dmitri D. Pervouchine</name>
<affiliation wicri:level="4"><nlm:affiliation>Centre for Genomic Regulation and UPF, Barcelona 08003, Spain Faculty of Bioengineering and Bioinformatics, Moscow State University, 119992 Moscow, Russia dp@crg.eu.</nlm:affiliation>
<orgName type="university">Université d'État de Moscou</orgName>
<country>Russie</country>
<placeName><settlement type="city">Moscou</settlement>
<region>District fédéral central</region>
</placeName>
</affiliation>
</author>
</analytic>
<series><title level="j">RNA (New York, N.Y.)</title>
<idno type="eISSN">1469-9001</idno>
<imprint><date when="2014" type="published">2014</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Animals</term>
<term>Base Sequence</term>
<term>Caenorhabditis elegans (genetics)</term>
<term>Conserved Sequence (genetics)</term>
<term>Drosophila melanogaster (genetics)</term>
<term>Exons (genetics)</term>
<term>Genes (genetics)</term>
<term>Humans</term>
<term>Introns (genetics)</term>
<term>Molecular Sequence Data</term>
<term>RNA Splicing (genetics)</term>
<term>RNA, Small Nucleolar (genetics)</term>
<term>Sequence Homology, Nucleic Acid</term>
<term>Software</term>
</keywords>
<keywords scheme="KwdFr" xml:lang="fr"><term>Animaux</term>
<term>Caenorhabditis elegans (génétique)</term>
<term>Données de séquences moléculaires</term>
<term>Drosophila melanogaster (génétique)</term>
<term>Exons (génétique)</term>
<term>Gènes (génétique)</term>
<term>Humains</term>
<term>Introns (génétique)</term>
<term>Logiciel</term>
<term>Petit ARN nucléolaire (génétique)</term>
<term>Similitude de séquences d'acides nucléiques</term>
<term>Séquence conservée (génétique)</term>
<term>Séquence nucléotidique</term>
<term>Épissage des ARN (génétique)</term>
</keywords>
<keywords scheme="MESH" type="chemical" qualifier="genetics" xml:lang="en"><term>RNA, Small Nucleolar</term>
</keywords>
<keywords scheme="MESH" qualifier="genetics" xml:lang="en"><term>Caenorhabditis elegans</term>
<term>Conserved Sequence</term>
<term>Drosophila melanogaster</term>
<term>Exons</term>
<term>Genes</term>
<term>Introns</term>
<term>RNA Splicing</term>
</keywords>
<keywords scheme="MESH" qualifier="génétique" xml:lang="fr"><term>Caenorhabditis elegans</term>
<term>Drosophila melanogaster</term>
<term>Exons</term>
<term>Gènes</term>
<term>Introns</term>
<term>Petit ARN nucléolaire</term>
<term>Séquence conservée</term>
<term>Épissage des ARN</term>
</keywords>
<keywords scheme="MESH" xml:lang="en"><term>Animals</term>
<term>Base Sequence</term>
<term>Humans</term>
<term>Molecular Sequence Data</term>
<term>Sequence Homology, Nucleic Acid</term>
<term>Software</term>
</keywords>
<keywords scheme="MESH" xml:lang="fr"><term>Animaux</term>
<term>Données de séquences moléculaires</term>
<term>Humains</term>
<term>Logiciel</term>
<term>Similitude de séquences d'acides nucléiques</term>
<term>Séquence nucléotidique</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">IRBIS is a computational pipeline for detecting conserved complementary regions in unaligned orthologous sequences. Unlike other methods, it follows the "first-fold-then-align" principle in which all possible combinations of complementary k-mers are searched for simultaneous conservation. The novel trimming procedure reduces the size of the search space and improves the performance to the point where large-scale analyses of intra- and intermolecular RNA-RNA interactions become possible. In this article, I provide a rigorous description of the method, benchmarking on simulated and real data, and a set of stringent predictions of intramolecular RNA structure in placental mammals, drosophilids, and nematodes. I discuss two particular cases of long-range RNA structures that are likely to have a causal effect on single- and multiple-exon skipping, one in the mammalian gene Dystonin and the other in the insect gene Ca-α1D. In Dystonin, one of the two complementary boxes contains a binding site of Rbfox protein similar to one recently described in Enah gene. I also report that snoRNAs and long noncoding RNAs (lncRNAs) have a high capacity of base-pairing to introns of protein-coding genes, suggesting possible involvement of these transcripts in splicing regulation. I also find that conserved sequences that occur equally likely on both strands of DNA (e.g., transcription factor binding sites) contribute strongly to the false-discovery rate and, therefore, would confound every such analysis. IRBIS is an open-source software that is available at http://genome.crg.es/~dmitri/irbis/. </div>
</front>
</TEI>
<affiliations><list><country><li>Russie</li>
</country>
<region><li>District fédéral central</li>
</region>
<settlement><li>Moscou</li>
</settlement>
<orgName><li>Université d'État de Moscou</li>
</orgName>
</list>
<tree><country name="Russie"><region name="District fédéral central"><name sortKey="Pervouchine, Dmitri D" sort="Pervouchine, Dmitri D" uniqKey="Pervouchine D" first="Dmitri D" last="Pervouchine">Dmitri D. Pervouchine</name>
</region>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001C30 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001C30 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Sante |area= MersV1 |flux= Main |étape= Exploration |type= RBID |clé= pubmed:25142064 |texte= IRBIS: a systematic search for conserved complementarity. }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/RBID.i -Sk "pubmed:25142064" \ | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd \ | NlmPubMed2Wicri -a MersV1
This area was generated with Dilib version V0.6.33. |